Evaluating the Quality of Multilingual Items Generated Using Automatic Processes: Preliminary Results from a Reliability Study

نویسندگان

Karen Fung

Mark J. Gierl

Hollis Lai

چکیده

Introduction The traditional approach to test item translation is an effortful, time-consuming process conducted by bilingual or multilingual content specialists. One important problem that arises when content specialists perform translations is the introduction of subjectivity into the process. For instance, Hambleton (1993) reported that when a content specialist is told that another specialist will be back translating his or her work, the content specialist selects words that are more likely to be translated into the original item. Of course, translations can also be evaluated using other methods such as administering the original and the translated test items to a group of specialists who are fluent in both languages. However other issues may arise such as the possibility that the translators are stronger in one language compared to another (Hambleton, Merenda, & Spielberger, 2005). As the implementation of technology in test development grows, the emergence of automatic item generation (AIG; Gierl & Haladyna, 2013) with the use of item models (Gierl, Alves, & Zhou, 2008) is becoming more prominent. The translation of item models may make it possible for large numbers of items in two or more languages to be generated simultaneously. To-date, the quality of such translation has yet to be evaluated. Hence, the purpose of our study is to begin this evaluative process by determining the similarity of the translated items when generative models are used to create items in multiple languages. We will also attempt to take advantage of the available technology for translation by using the online engine Google Translate. With the growing demand for computer-based testing and computer adaptive testing, a large number of items are now required to permit flexible test administration while maintaining adequate item security. When more items are needed for this item banking model, the traditional approach of item development becomes effortful and time consuming and it requires extensive financial resources. Traditionally, items are hand crafted by content specialists one-by-one to ensure certain content areas with appropriate skill levels are covered. But this process becomes expensive as more items are needed. The use of Automatic Item Generation (AIG) saves time and money by combining content expertise with computer technology to automatically create new items. The development of multiple-choice items in the medical context using AIG was recently described by Gierl, Lai, and Turner (2012) using a three-step process. In the first step, content specialists create a graphical representation known as a cognitive …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of the Reliability of Automatic Manufacture Systems by Using FTA Technique

In recent years, Many manufacturing industries for promoting their efficiency have tended to use the automatic manufacturing systems. Expanding automatic systems and to increase their complexity are representing the necessity of studying a proper functional quality and using reliable equipment in such systems more than ever. In this direction, the technique of fault tree analysis (FTA), along w...

متن کامل

Reliability and Performance of SEVQUAL Survey in Evaluating Quality of Medical Education Services

Background and Objectives: Considering the importance of medical education quality in achieving a healthy community, there is a need for the development of valid and reliable tools for efficient measurement of quality of medical education services. SEVQUAL is a popular services quality measuring framework used in assessment of quality in various service sectors. The purpose of this study was to...

متن کامل

Psychometric Properties of a Persian Version of the Specialty Indecision Scale: A Preliminary Study

Introduction: Diagnosis and management of specialty choice indecision is an important part of career guidance and support for medical students. Determining causes of indecision and resolving them helps students to make an optimum decision. The aim of this study was to determine the psychometric properties of a Persian version of the specialty indecision scale as an on-line questionnaire for med...

متن کامل

Translation and adaptation of "Checklist of pragmatic behaviors’'to Farsi: a preliminary study

objective: Pragmatic skills play a significant role in social interaction and highly influences the determination of future academic achievement. Early pragmatic assessment provides early intervention that's why there is necessity of pragmatic assessment tools in children. In this regard, there is a need for an observational tool for preschool level for pragmatic studies in Iran. The Checklist...

متن کامل

On the Validation of a Preliminary Model of Reading Strategy Using SEM: Evidence From Iranian ELT Postgraduate Students

The present study was an attempt to refine a qualitatively proposed model of ELT discipline-specific reading strategies to provide a better interpretation of qualitative findings. Hence, in line with the components of the previous model, that is, 6 factors and 32 categories, a 6-hypothetical factor and a 33-item questionnaire were considered in the design of the ELT discipline-specific question...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Evaluating the Quality of Multilingual Items Generated Using Automatic Processes: Preliminary Results from a Reliability Study

نویسندگان

چکیده

منابع مشابه

Improvement of the Reliability of Automatic Manufacture Systems by Using FTA Technique

Reliability and Performance of SEVQUAL Survey in Evaluating Quality of Medical Education Services

Psychometric Properties of a Persian Version of the Specialty Indecision Scale: A Preliminary Study

Translation and adaptation of &quot;Checklist of pragmatic behaviors’'to Farsi: a preliminary study

On the Validation of a Preliminary Model of Reading Strategy Using SEM: Evidence From Iranian ELT Postgraduate Students

عنوان ژورنال:

اشتراک گذاری

Translation and adaptation of "Checklist of pragmatic behaviors’'to Farsi: a preliminary study